PAGE: 1 RAW SEQUENCE -LISTING ' DATE: 02/10/1999 

PATENT APPLICATION US/08/240,410 TIME: 15:52:36 



INPUT SET: S30581.raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



1 SEQUENCE LISTING 
2 

3 (1) General Information r« ^ . __ _^ 

^ ENTERFf 

5 / APPLICANT: MICHALOVICH; DAVID H It 

6 HAYES, PHILIP DAVID 

8 (ii) TITLE OF THE INVENTION: NOVEL COMPOUNDS 
9 
10 

11 (iii) NUMBER OF SEQUENCES: 4 ' 

13 (iv) CORRESPONDENCE ADDRESS: -Jl rr\ 

14 (A) ADDRESSEE: Ratner & Prestia g r' "O 

15 (B) STREET: P.O. Box 980 ^ ^ 

16 (C) CITY: Valley Forge ^ ^ 

17 (D) STATE: PA p: ^ pl 

18 (E) COUNTRY: USA :X3 ^ 
. 19 (F) ZIP: 19482 O 

20 e 

21 (v) COMPUTER READABLE FORM: 

22 (A) MEDIUM TYPE: Diskette 

23 (B) COMPUTER: IBM Compatible 

24 (C) OPERATING SYSTEM: DOS 

25 (D) SOFTWARE: FastSEQ for Windows Version 2.0 
26 

27 (vi) CURRENT APPLICATION DATA: 

--> 28 (A) APPLICATION NUMBER: TO BE ASSIGNED 

29 (B) FILING DATE: 27-JAN-1999 

--> 30 (C) CLASSIFICATION: UNKNOWN 

31 

32 (vii) PRIOR APPLICATION DATA: 

33 (A) APPLICATION NUMBER: UK APPLICATION NO. TO BE ASSIGNED 

34 (B) FILING DATE: 20-JAN-1999 
35 

36 (A) APPLICATION NUMBER: EP APPLICATION NO. 98300694.1 

37 (B) FILING DATE: 30-JAN-1998 
38 

39 

40 (viii) ATTORNEY/ AGENT INFORMATION: 

41 (A) NAME: Prestia, Paul F 

42 (B) REGISTRATION NUMBER: 23,031 

43 (C) REFERENCE /DOCKET NUMBER: GP-3 003 9 
44 

45 (ix) TELECOMMUNICATION INFORMATION: 

46 (A) TELEPHONE: 610-407-0700 
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47 (B) TELEFAX: 610-407-0700 

48 (C) TELEX: 846169 
49 

50 

51 (2) INFORMATION FOR SEQ ID NO : 1 : 

52 ^ 

53 (i) SEQUENCE CHARACTERISTICS: O ^ 

54 (A) LENGTH: 2010 base pairs 

55 (B) TYPE: nucleic acid ^ '\ 

56 (C) STRANDEDNESS : single O ^ fH 

57 (D) TOPOLOGY: linear CO "^^L 

58 ^ ^ 

59 (ii) MOLECULE TYPE: CDNA ^ X§ C3 

60 S ^ 

61 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : O 

62 'X 

63 ATGACGCCTC CTCCGCCCGG ACGTGCCGCC CCCAGCGCAC CGCGCGCCCG CGTCCCTGGC 60 

64 CCGCCGGCTC GGTTGGGGCT TCCGCTGCGG CTGCGGCTGC TGCTGCTGCT CTGGGCGGCC 120 

65 GCCGCCTCCG CCCAGGGCCA CCTAAGGAGC GGACCCCGCA TCTTCGCCGT CTGGAAAGGC 180 

66 CATGTAGGGC AGGACCGGGT GGACTTTGGC CAGACTGAGC CGCACACGGT GCTTTTCCAC 240 

67 GAGCCAGGCA GCTCCTCTGT GTGGGTGGGA GGACGTGGCA AGGTCTACCT CTTTGACTTC 300 

68 CCCGAGGGCA AGAACGCATC TGTGCGCACG GTGAATATCG GCTCCACAAA GGGGTCCTGT 360 

69 CTGGATAAGC GGGACTGCGA GAACTACATC ACTCTCCTGG AGAGGCGGAG TGAGGGGCTG 42 0 

70 CTGGCCTGTG GCACCAACGC CCGGCACCCC AGCTGCTGGA ACCTGGTGAA TGGCACTGTG 480 

71 GTGCCACTTG GCGAGATGAG AGGCTACGCC CCCTTCAGCC CGGACGAGAA CTCCCTGGTT 540 

72 CTGTTTGAAG GGGACGAGGT GTATTCCACC ATCCGGAAGC AGGAATACAA TGGGAAGATC 600 

73 CCTCGGTTCC GCCGCATCCG GGGCGAGAGT GAGCTGTACA CCAGTGATAC TGTCATGCAG 660 

74 AACCCACAGT TCATCAAAGC CACCATCGTG CACCAAGACC AGGCTTACGA TGACAAGATC 720 

75 TACTACTTCT TCCGAGAGGA CAATCCTGAC AAGAATCCTG AGGCTCCTCT CAATGTGTCC 780 

76 CGTGTGGCCC AGTTGTGCAG GGGGGACCAG GGTGGGGAAA GTTCACTGTC AGTCTCCAAG 840 

77 TGGAACACTT TTCTGAAAGC CATGCTGGTA TGCAGTGATG CTGCCACCAA CAAGAACTTC 900 

78 AACAGGCTGC AAGACGTCTT CCTGCTCCCT GACCCCAGCG GCCAGTGGAG GGACACCAGG 960 

79 GTCTATGGTG TTTTCTCCAA CCCCTGGAAC TACTCAGCCG TCTGTGTGTA TTCCCTCGGT 102 0 

80 GACATTGACA AGGTCTTCCG TACCTCCTCA CTCAAGGGCT ACCACTCAAG CCTTCCCAAC 1080 

81 CCGCGGCCTG GCAAGTGCCT CCCAGACCAG CAGCCGATAC CCACAGAGAC CTTCCAGGTG 1140 

82 GCTGACCGTC ACCCAGAGGT GGCGCAGAGG GTGGAGCCCA TGGGGCCTCT GAAGACGCCA 1200 

83 TTGTTCCACT CTAAATACCA CTACCAGAAA GTGGCCGTCC ACCGCATGCA AGCCAGCCAC 1260 

84 GGGGAGACCT TTCATGTGCT TTACCTAACT ACAGACAGGG GCACTATCCA CAAGGTGGTG 1320 

85 GAACCGGGGG AGCAGGAGCA CAGCTTCGCC TTCAACATCA TGGAGATCCA GCCCTTCCGC 1380 

86 CGCGCGGCTG CCATCCAGAC CATGTCGCTG GATGCTGAGC GGAGGAAGCT GTATGTGAGC 1440 

87 TCCCAGTGGG AGGTGAGCCA GGTGCCCCTG GACCTGTGTG AGGTCTATGG CGGGGGCTGC 1500 

88 CACGGTTGCC TCATGTCCCG AGACCCCTAC TGCGGCTGGG ACCAAGGCCG CTGCATCTCC 1560 

89 ATCTACAGCT CCGAACGGTC AGTGCTGCAA TCCATTAATC CAGCCGAGCC ACACAAGGAG 1620 

90 TGTCCCAACC CCAAACCAGA CAAGGCCCCA CTGCAGAAGG TTTCCCTGGC CCCAAACTCT 1680 

91 CGCTACTACC TGAGCTGCCC CATGGAATCC CGCCACGCCA CCTACTCATG GCGCCACAAG 1740 

92 GAGAACGTGG AGCAGAGCTG CGAACCTGGT CACCAGAGCC CCAACTGCAT CCTGTTCATC 1800 

93 GAGAACCTCA CGGCGCAGCA GTACGGCCAC TACTTCTGCG AGGCCCAGGA GGGCTCCTAC 1860 

94 TTCCGCGAGG CTCAGCACTG GCAGCTGCTG CCCGAGGACG GCATCATGGC CGAGCACCTG 1920 

95 CTGGGTCATG CCTGTGCCCT GGCCGCCTCC CTCTGGCTGG GGGTGCTGCC CACACTCACT 1980 

96 CTTGGCTTGC TGGTCCACTA GGGCCTCCCG 2010 
97 

98 (2) INFORMATION FOR SEQ ID NO : 2 : 

99 
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100 




(i) SEQUENCE CHARACTERISTICS: 












101 






(A) 


LENGTH : 


666 


amino acids 












102 






(B) 


TYPE : amino 


acid 














103 






(C) 


STRANDEDNESS : s: 


Lngle 












104 






(D) 


TOPOLOGY: linear 














105 






























106 
107 




(ii) MOLECULE 


TYPE : protein 












108 
109 




(xi) SEQUENCE 


DESCRIPTION 


: SEQ ID 


NO: 2: 






110 


Met 


Thr 


Pro 


Pro 


Pro 


Pro 


Gly 


Arg 


Ala 


Ala 


Pro 


Ser Ala Pro 


Arg 


Ala -J % 


111 


1 








5 










10 






15 




112 






























113 


Arg 


Val 


Pro 


Gly 


Pro 


Pro 


Ala 


Arg 


Leu 


Gly 


Leu 


Pro Leu Arg 


Leu 


Arg ^ ^ 


114 








20 










25 






30 




O 


115 


Leu 


Leu 


Leu 


Leu 


Leu 


Trp 


Ala 


Ala 


Ala 


Ala 


Ser 


Ala Gin Gly 


His 


Leu 


116 






35 










40 








45 






117 


Arg 


Ser 


Gly 


Pro 


Arg 


He 


Phe 


Ala 


Val 


Trp 


Lys 


Gly His Val 


Gly 


Gin 


118 




50 










55 










60 






119 


Asp 


Arg 


Val 


Asp 


Phe 


Gly 


Gin 


Thr 


Glu 


Pro 


His 


Thr Val Leu 


Phe 


His 


120 


65 










70 










75 






80 


121 


Glu 


Pro 


Gly 


Ser 


Ser 


Ser 


Val 


Trp 


Val 


Gly 


Gly 


Arg Gly Lys 


Val 


Tyr 


122 










85 










90 






95 




123 


Leu 


Phe 


Asp 


Phe 


Pro 


Glu 


Gly 


Lys 


Asn 


Ala 


Ser 


Val Arg Thr 


Val 


Asn 


124 








100 










105 






110 






125 


lie 


Gly 


Ser 


Thr 


Lys 


Gly 


Ser 


Cys 


Leu 


Asp 


Lys 


Arg Asp Cys 


Glu 


Asn 


126 






115 










120 








125 






127 


Tyr 


He 


Thr 


Leu 


Leu 


Glu 


Arg 


Arg 


Ser 


Glu 


Gly 


Leu Leu Ala 


Cys 


Gly 


128 




130 










135 










140 






129 


Thr 


Asn 


Ala 


Arg 


His 


Pro 


Ser 


Cys 


Trp 


Asn 


Leu 


Val Asn Gly 


Thr 


Val 


130 


145 










150 










155 






160 


131 


Val 


Pro 


Leu 


Gly 


Glu 


Met 


Arg 


Gly 


Tyr 


Ala 


Pro 


Phe Ser Pro 


Asp 


Glu 


132 










165 










170 






175 




133 


Asn 


Ser 


Leu 


Val 


Leu 


Phe 


Glu 


Gly 


Asp 


Glu 


Val 


Tyr Ser Thr 


He 


Arg 


134 








180 










185 






190 






135 


Lys 


Gin 


Glu 


Tyr 


Asn 


Gly 


Lys 


He 


Pro 


Arg 


Phe 


Arg Arg He 


Arg 


Gly 


136 






195 










200 








205 






137 


Glu 


Ser 


Glu 


Leu 


Tyr 


Thr 


Ser 


Asp 


Thr 


val 


Met 


Gin Asn Pro 


Gin 


Phe 


138 




210 










215 










220 






139 


lie 


Lys 


Ala 


Thr 


He 


Val 


His 


Gin 


Asp 


Gin 


Ala 


Tyr Asp Asp 


Lys 


He 


140 


225 










230 










235 






240 


141 


Tyr 


Tyr 


Phe 


Phe 


Arg 


Glu 


Asp 


Asn 


Pro 


Asp 


Lys 


Asn Pro Glu 


Ala 


Pro 


142 










245 










250 






255 




143 


Leu 


Asn 


Val 


Ser 


Arg 


Val 


Ala 


Gin 


Leu 


Cys 


Arg 


Gly Asp Gin 


Gly 


Gly 


144 








260 










265 






270 






145 


Glu 


Ser 


Ser 


Leu 


Ser 


Val 


Ser 


Lys 


Trp 


Asn 


Thr 


Phe Leu Lys 


Ala 


Met 


146 






275 










280 








285 






147 


Leu 


Val 


Cys 


Ser 


Asp 


Ala 


Ala 


Thr 


Asn 


Lys 


Asn 


Phe Asn Arg 


Leu 


Gin 


148 




290 










295 










300 






149 


Asp 


Val 


Phe 


Leu 


Leu 


Pro 


Asp 


Pro 


Ser 


Gly 


Gin 


Trp Arg Asp 


Thr 


Arg 


150 


305 










310 










315 






320 


151 


Val 


Tyr 


Gly 


Val 


Phe 


Ser 


Asn 


Pro 


Trp 


Asn 


Tyr 


Ser Ala Val 


Cys 


Val 


152 










325 










330 






335 





"a 
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153 


Tyr Ser Leu Gly 


Asp 


He 


Asp 


Lys 


Val 


Phe 


Arg 


Thr 


Ser 


Ser 


Leu 


Lys 


154 




340 










345 










350 






155 


Gly Tyr His 


Ser 


Ser 


Leu 


Pro 


Asn 


Pro 


Arg 


Pro 


Gly Lys 


Cys 


Leu 


Pro 


156 


355 










360 










365 








157 


Asp Gin Gin 


Pro 


He 


Pro 


Thr 


Glu 


Thr 


Phe 


Gin 


Val 


Ala 


Asp 


Arg 


His 


158 


370 








375 










380 










159 


Pro Glu Val 


Ala 


Gin 


Arg 


Val 


Glu 


Pro 


Met 


Gly 


Pro 


Leu 


Lys 


Thr 


Pro 


160 


385 






390 










3 95 










400 


161 


Leu Phe His 


Ser 


Lys 


Tyr 


His 


Tyr 


Gin 


Lys 


Val 


Ala 


val 


His 


Arg 


Met 


162 






405 










410 










415 




163 


Gin Ala Ser 


His 


Gly 


Glu 


Thr 


Phe 


His 


Val 


Leu 


Tyr 


Leu 


Thr 


Thr 


Asp 


164 




420 










425 










430 






165 


Arg Gly Thr 


He 


His 


Lys 


Val 


Val 


Glu 


Pro 


Gly 


Glu 


Gin 


Glu 


His 


Ser 


166 


435 










440 










445 








167 


Phe Ala Phe 


Asn 


He 


Met 


Glu 


He 


Gin 


Pro 


Phe 


Arg Arg Ala Ala 


Ala 


168 


450 








455 










460 










169 






























170 


He Gin Thr 


Met 


Ser 


Leu 


Asp 


Ala 


Glu 


Arg 


Arg 


Lys 


Leu 


Tyr 


Val 


Ser 


171 


465 






470 










475 










480 


172 


Ser Gin Trp Glu 


Val 


Ser 


Gin 


Val 


Pro 


Leu 


Asp 


Leu 


Cys 


Glu 


Val 


Tyr 


173 






485 










490 










495 




174 


Gly Gly Gly Cys 


His 


Gly 


Cys 


Leu 


Met 


Ser 


Arg 


Asp 


Pro 


Tyr 


Cys 


Gly 


175 




500 










505 










510 






176 


Trp Asp Gin Gly 


Arg 


Cys 


He 


Ser 


He 


Tyr 


Ser 


Ser 


Glu Arg 


Ser 


Val 


177 


515 










520 










525 








178 


Leu Gin Ser 


He 


Asn 


Pro 


Ala 


Glu 


Pro 


His 


Lys 


Glu 


Cys 


Pro 


Asn 


Pro 


179 


530 








535 










540 










180 


Lys Pro Asp 


Lys 


Ala 


Pro 


Leu 


Gin 


Lys 


Val 


Ser 


Leu 


Ala 


Pro 


Asn 


Ser 


181 


545 






550 










555 










560 


182 


Arg Tyr Tyr 


Leu 


Ser 


Cys 


Pro 


Met 


Glu 


Ser 


Arg 


His 


Ala 


Thr 


Tyr 


Ser 


183 






565 










570 










575 




184 


Trp Arg His 


Lys 


Glu 


Asn 


Val 


Glu 


Gin 


Ser 


Cys 


Glu Pro Gly His 


Gin 


185 




580 










585 










590 






186 


Ser Pro Asn 


Cys 


He 


Leu 


Phe 


He 


Glu 


Asn 


Leu 


Thr 


Ala 


Gin 


Gin 


Tyr 


187 


595 










600 










605 








188 


Gly His Tyr 


Phe 


Cys 


Glu 


Ala 


Gin 


Glu 


Gly 


Ser 


Tyr 


Phe 


Arg 


Glu 


Ala 


189 


610 








615 










620 










190 


Gin His Trp 


Gin 


Leu 


Leu 


Pro 


Glu 


Asp 


Gly 


He 


Met 


Ala 


Glu 


His 


Leu 


191 


625 






630 










635 










640 


192 


Leu Gly His Ala 


Cys 


Ala 


Leu 


Ala 


Ala 


Ser 


Leu 


Trp Leu Gly Val 


Leu 


193 






645 










650 










655 




194 


Pro Thr Leu 


Thr 


Leu 


Gly 


Leu 


Leu 


Val 


His 














195 




660 










665 
















196 






























197 


(2) 


INFORMATION FOR SEQ ID 


NO: 3 














198 






























199 


(i) SEQUENCE CHARACTERISTICS: 
















200 


(A) 


LENGTH : 


712 


base pairs 
















201 


(B) 


TYPE: nucleic acid 


















202 


(C) 


STRANDEDNESS 


1: single 
















203 


(D) 


TOPOLOGY : 1 inear 


















204 






























205 


(ii) MOLECULE 


TYPE: CDNA 
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206 

207 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

208 

209 CCGCCTGCCG CCCAGGGCCA CCTAAGGAGC GGATNCTANN TCTTCGCCGT CTGGAAAGGC 60 

210 CATGTAGGGC AGGACCGGGT GGACTTTGGC CAGACTGAGC CGCACACGGT GCTTTTCCAC 120 

211 GAGCCAGGCA GCTCCTCTGT GTGGGTGGGA GGACGTGGCA AGGTCTACCT CTTTGACTTC 180 

212 CCCGAGGGCA AGAACGCATC TGTGCGCACG GTGAATATCG GCTCCACAAA GGGGTCCTGT 24 0 

213 CTGGATAAGC GGGACTGCGA GAACTACATC ACTCTCCTGG AGAGGCGGAG TGAGGGGCTG 3 00 

214 CTGGCCTGTG GCACCAACGC CCGGCACCCC AGCTGCTGGA ACCTGGTGAA TGCACTGTGG 360 

215 TGCCACCTTG GCGAGAGTGG AGGCTACGCC CCCTTCAGCC CGGACGAGAA CGTCCCGTGG 420 

216 TTCTGTTTTG AAGGGGACGA AGTGTATTCC ACCATCCGGA AAGCAAGGAA TTACAATTGG 480 

217 GAAGATCCTC GGTTCCGCCG CATCCGGGGC GAGAGTGAGC TGTACACCAG TGATACTGTC 540 

218 ATGCAGAACC CACAGTTCAT CAAAGCCACC ATCGTGCACC AAGACCAGGC TTACGATGAC 600 

219 AAGATCTACT ACTTCTTCCG AGAGGACAAT CCTGACAAGA ATCCTGAGGC TCCTCTCAAT 660 

220 GTGTCCCGTG TGGCCCAGTT GTGCAGGGGG GACCAGGGTG GGGAAAGTTC AN 712 
221 

.222 (2) INFORMATION FOR SEQ ID N0:4: 

223 

224 (i) SEQUENCE CHARACTERISTICS: 

225 (A) LENGTH: 215 amino acids 

226 (B) TYPE: amino acid 

227 (C) STRANDEDNESS : single 

228 (D) TOPOLOGY: linear 
229 

23 0 (ii) MOLECULE TYPE: protein 

231 

232 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

233 



234 


Gly Gin 


Asp 


Arg 


Val 


Asp 


Phe 


Gly 


Gin 


Thr 


Glu 


Pro 


His 


Thr 


Val 


Leu 


235 


1 








5 










10 










15 




236 


Phe 


His 


Glu 


Pro 


Gly 


Ser 


Ser 


Ser 


Val 


Trp 


Val 


Gly 


Gly 


Arg 


Gly Lys 


237 








20 










25 










30 






238 


Val 


Tyr 


Leu 


Phe 


Asp 


Phe 


Pro 


Glu 


Gly 


Lys 


Asn 


Ala 


Ser 


Val 


Arg 


Thr 


239 






35 










40 










45 








240 


Val 


Asn 


He 


Gly 


Ser 


Thr 


Lys 


Gly 


Ser 


Cys 


Leu 


Asp 


Lys 


Arg 


Asp 


Cys 


241 




50 










55 










60 










242 


Glu 


Asn 


Tyr 


He 


Thr 


Leu 


Leu 


Glu 


Arg 


Arg 


Ser 


Glu 


Gly 


Leu 


Leu 


Ala 


243 


65 










70 










75 










80 


244 


Cys 


Gly 


Thr 


Asn 


Ala 


Arg 


His 


Pro 


Ser 


Cys 


Trp 


Asn 


Leu 


Val 


Asn 


Ala 


245 










85 










90 










95 




246 


Leu 


Trp 


Cys 


His 


Leu 


Gly 


Glu 


Ser 


Gly 


Gly 


Tyr 


Ala 


Pro 


Phe 


Ser 


Pro 


247 








100 










105 










110 






248 


Asp 


Glu 


Asn 


Val 


Pro 


Trp 


Phe 


Cys 


Phe 


Glu 


Gly 


Asp 


Glu 


Val 


Tyr 


Ser 


249 






115 










120 










125 








250 


Thr 


He 


Arg 


Lys 


Ala 


Arg 


Asn 


Tyr 


Asn 


Trp 


Glu 


Asp 


Pro 


Arg 


Phe 


Arg 


251 




130 










135 










140 










252 


Arg 


He 


Arg 


Gly 


Glu 


Ser 


Glu 


Leu 


Tyr 


Thr 


Ser 


Asp 


Thr 


val 


Met 


Gin 


253 


145 










150 










155 










160 


254 


Asn 


Pro 


Gin 


Phe 


He 


Lys 


Ala 


Thr 


He 


Val 


His 


Gin 


Asp 


Gin 


Ala 


Tyr 


255 










165 










170 










175 




256 


Asp Asp 


Lys 


He 


Tyr 


Tyr 


Phe 


Phe 


Arg 


Glu 


Asp 


Asn 


Pro 


Asp 


Lys 


Asn 


257 








180 
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258 


Pro 


Glu 


Ala 


Pro 


Leu 


Asn 


Val 
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Arg 


Val 


Ala 


Gin 


Leu 


Cys 


Arg Gly 
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Line Error 

28 Wrong application Serial Number 

30 Wrong Classification 



Original Text 

(A) APPLICATION NUMBER: TO BE ASSIGNED 
(C) CLASSIFICATION: UNKNOWN 



